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Detecting the risk of cardiovascular diseases by detecting 
mutations in genes, including genes encoding a2b-adrenpceptor 
and apolipoprotein B. 

The present invention provides a method of identifying subject's susceptibility to 
cardiovascular diseases or risk of developing myocardial infarction (MI) or 
5 cerebrovascular stroke by detecting gene polymorphisms and other gene mutations from a 
biological sample of the subject and optionally obtaining information concerning the 
family and medical history, blood, serum, plasma, urinary analytes and clinical findings of 
the subject. The invention also provides a multivariate model, a combination or algorithm 
of variables which best describes the probability of cardiovascular diseases, especially MI 
10 and stroke. The invention also relates to a test kit and software for accomplishing the 
method. 

FIELD OF THE INVENTION 

15 

The present invention is generally directed to a method for assessing the risk of 

Specifically, the invention is directed to a method that utilises both genetic and phenotypic 
information as well as information obtained by questionnaires to construct a score that 
20 provides the probability of developing an MI or stroke. Furthermore, the invention 

provides a kit for carrying out the method The kit can be used to set an etiology-based 
diagnosis of cardiovascular diseases for targeting of treatment and preventive 
interventions, such as dietary advice as well as stratification of the subject in clinical trials 
testing drugs and other interventions. 

25 

BACKGROUND OF THE INVENTION 

The coronary heart disease (CHD) and cerebrovascular disease are multifactorial diseases 
and the leading causes of morbidity, death and disability globally. Even though the age- 

30 standardized incidence of and mortality from CHD and stroke are still declining in the 
Western world, the number of cardiovascular events and subsequent hospitalizations and 
expenditure are increasing, due to the elevation of life expectancy of the population. It has 
been estimated based on twin and migration studies that the heritability of CHD and stroke 
is of the order of 50-60% and there are no major gene effects. Thus, multiple genes and 

35 non-genetic risk factors contribute to the development and progression of CHD. Different 
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clinical manifestations of CHD (i.e. angina pectoris, myocardial infarction, sudden death) 
and stroke have overlapping but also somewhat distinct pathophysiology and risk factors. 



CHD and stroke may be caused in different individuals by different reasons and through 
5 different pathophysiologic pathways. Often, however, the same risk factors and pathways 
are operating, but their importance for each individual varies. Regarding pathophysiology, 
CHD and stroke may be caused by obstruction of the coronary (cerebrovascular) arteries, 
vasoconstriction or vasospasm in these, thrombotic phenomena or arrhythmias. Coronary 
and cerebrovascular arterial obstruction is most often caused by atheroma formation. This 
10 is a complex disease, but lipids and their metabolism such as oxidation plays a key role. 
Other major factors leading to atheroma formation are tobacco smoking, hypertension, 
diabetes, obesity and hyperhomocysteinemia. Additional risk factors include elevated 
coagulation factors, platelet activation and decreased nitric oxid availability. Men, older 
persons and those with a family history of CHD are at elevated risk. 

15 

Persons who have mutations in genes regulating lipids, their metabolism, blood pressure, 
platelet functions, coagulation, fibrinolysis, homocysteine metabolism and the function of 

of these mutations can be used to predict MI and cerebrovascular stroke.. 

20 

A number of meta-analyses have studied multivariate risk functions from diverse 
populations in the prediction of CHD. None of these have concerned the effects of specific 
genotyped gene mutations. A recent meta-analysis concerned ordering risk, magnitude of 
relative risks, and estimation of absolute risk in prospective cohort studies (Diverse 

25 Populations Collaborative Group 2002). The outcome measure was death from CHD. The 
analysis included 105 420 men and 56 535 women 35-74 years of age and free of CHD at 
baseline from 16 observational studies with a total of 27 analytical groups. The area under 
the receiver operating characteristic curve (AUG) was used to judge the ability of the 
multivariate risk function to order risk correctly. The AUCs differed significantly between 

30 the studies (p < 0.01) but were very similar for different risk functions applied to the same 
population, indicating similar ability to rank risk for different models. The magnitudes of 
the relative risks associated with major risk factors (age, systolic blood pressure, serum 
total cholesterol, smoking, and diabetes) varied significantly across studies (p < 0.05 for 
homogeneity). The prediction of absolute risk was not very accurate in most of the cases 
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when a model derived from one study was applied to a different study. The authors 
concluded that when considered qualitatively, the major risk factors are associated with 
CHD mortality in a diverse set of populations. 

5 The new Sheffield table and modified joint British societies coronary risk prediction (JBS) 
chart are widely used (Rabindranath et al 2002). The JBS chart approximates age and 
systolic blood pressure, and the new Sheffield table dichotomises blood pressure, and these 
simplifications may lead to diagnostic inaccuracy. Methods: The diagnostic performance 
of the charts against an individualised laboratory based CHD risk calculation in 1 102 
10 subjects in primary care were evaluated and compared. The new Sheffield table and 
modified JBS chart performed equally well 

Most previously used models used to predict individual risk of death from coronary heart 
disease (CHD) were developed from data of three decades ago from the Framingham Heart 
1 5 Study. CHD mortality rates have declined markedly since that period as a result of 

improvement in both risk factor status and medical interventions. Generalization of the 
results from this one study to the population at large remains a matter of concern. Liao and 

CHD from Framingham and two more recent American cohorts, the First and Second 
20 National Health and Nutrition Examination Survey (NHANES I and NHANES IT). The 

participants included 1846 men and 2323 women 35 to 69 years of age and free of CHD at 
the fourth examination (1954 to 1958) from the Framingham Study; 2753 men and 3858 
women from the NHANES I (1971 to 1975); and 2655 men and 3050 women from 
NHANES II (1976 to 1980). The three cohorts were monitored for 24, 20, and 15 years, 
25 respectively. Significant heterogeneity existed among studies in the magnitude of the Cox 
coefficients for the individual factors (ie, age, systolic blood pressure, serum total 

■ 

cholesterol, and smoking status), especially among men. When risk factors were 
considered collectively, however, functions derived from and applied to different cohorts 
had a similar ability to rank individual risk. The areas under the receiver operating 
30 characteristic curves were 0. 7 1 to 0.76 in men and 0.76 to 0.8 1 in women when different 
risk functions were applied to their own population or to a second population. The 
cumulative CHD survival observed in women in the two cohorts was close to what was 
predicted from the Framingham equation. The authors concluded that the Framingham risk 
model for the prediction of CHD mortality rates provides a reasonable rank ordering of risk 
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for individuals in the US white population for the period 1975 to 1990. However, 
prediction of absolute risk is less accurate. 



SUMMARY OF THE INVENTION 

5 

The object of the present invention is a method of identifying the risk of cardiovascular 
diseases, especially MI and stroke, by detecting gene polymorphisms and other gene 
mutations from a biological sample of the subject. The information obtained from this 
method can be combined with other information concerning an individual, e.g. results from 

10 blood measurements, clinical examination and questionnaires. The genetic information 
includes data on mutations in genes associated with MI and/or stroke. The blood 
measurements include the determination of blood or plasma or serum analytes that predict 
CHD or stroke such as blood lipid, homocysteine, glucose, and insulin concentrations and 
urinary excretion of nicotine metabolites. The information to be collected by questionnaire 

15 includes information concerning gender, age, family and medical history and health-related 
habits such as smoking. Clinical information collected by examination includes e.g. 
information concerning height weight, hip anf waist circumference, systolic and diastolic 



uptake. 

20 

The invention particularly provides a method for diagnosing a susceptibility to 
cardiovascular disease especially myocardial infarction (MI) and stroke in a subject 
by detecting genetic variation or polymorphism, i.e. a mutation, in at least three of 
the genes selected from the group consisting of: 

25 (a) a2B-adrenoceptor 

(b) apolipoprotein B 

(c) dimethylarginine dimethylaminohydrolase 1 

(d) fibrinogen-beta 

(e) neuropeptide Y 

30 (f) natriuretic peptide precursor A 

(g) cystathione beta synthase 

(h) glycoprotein Ub/JRa 

(i) lipoprotein lipase 
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comprising the steps of: 

i) providing a biological sample of the subject to be tested, 

ii) detecting the presence of mutations in the genes, the presence of a mutation in one 

5 or several of the genes indicating an increased risk of coronary heart disease (CHD) and/or 
myocardial infarction (MI) in said subject 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE 
INVENTION 

10 

In a preferred embodiment the invention comprises the combination of information from a 
large number of variables (measurements) to predict the probability of MI and stroke. The 
predictor information includes an assessment of genotypes and haplotypes in genomic 
DNA and optionally data obtainable by interviews, questionnaires, clinical examination 
15 and/or blood analyte measurements. This predictor information can be collected in any age. 
This method is also applicable to middle-aged persons. 

Information concerning genomic DNA genotypes concerns polymorphisms such as single 
nucleotide polymorphisms (SNPs) and mutations in e.g. the following genes (OMIM 

20 abbreviations): APOA1, APOA2, APOA4, APOB, APOC1, APOC2, APOC3, APOC4, 
APOD, APOE, ARG, LDLR, OLR1, MSR1, MSR2 , LPA, LPL, LIPC, LIPG, CETP, ETL, 
GPHIa, ICAM1, ICAM2, ICAM3, SELL, SELE, MMP1, MMP3, ITGB n, ADD1, ADD2, 
ADD3, NPY, NPY1R, NPY2R, NPY3R, NPY4R, NPY5R, HFE1, HFE2, HFE3, TFRC, 
TFR2, PON1, PON2, SOD1, SOD2, SOD3, CAT, GSTM1, GSTM2, GSTM3, GSTP1, 

25 GPX1, GPX3, TNFA, TNFB, TRX, NOS3, NOS3, DDAH1, DDAH2, ADRB1, ADRB2, 
ADRB3, F2, F5, F7, F8, F13, VWF, PAH, PAI2, FGA, FGB, FGG, ACE, AGT, AGTR1, 
ATG, SCAP, SCNN1A, SCNN1B, NPPA, CBS, MTHFR, or any other candidate genes 
that will be observed to relate to the susceptibility to MI or stroke. 

30 The data that can be obtained by questionnaire, interview or clinical examination includes 
information concerning: 

1) age, 

2) gender, 
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3) medical history, i.e. prevalent diseases, 

4) family history, i.e. diseases of parents and siblings, 

5) tobacco smoking, 

6) alcohol use, 

5 7) physical activity and exercise, 

8) high weight or obesity in childhood and adolescence, 

9) personality traits such as depression, anxiety, hostility, 

1 0) psychological and mood states such as anger, irritability, 

1 1) low self-esteem or weak self-image, 

10 12) lack of social skills, social isolation, lack of social networks, 

13) self-image promoting alcohol use (e.g. easy-taking), 

14) adulthood socioeconomic circumstances (e.g. being single, divorced or widowed as 
the marital status, posessing no phone, low socioeconomic status, unemployment 
and urban place of residence, 

15 15) stressful life events, 

16) coping styles, coping capacity, anger control, 

17) history of diabetes, 



19) high amount of hospitalizations, poor health status, 
20 20) blood pressure, heart rate, maximal oxygen uptake, 

21) other relevant information that can be collected by self-administered questionnaire, 
by an interview or by clinical examination of fee subject. 

Information obtainable by measurements from blood, blood cell, plasma, serum or urine 
25 samples includes: 

1) serum or plasma cholesterol, HDL and LDL cholesterol, 

2) serum or plasma triglycerides, 

3) serum or plasma apolipoproteins, 

4) serum or plasma insulin concentration, 
30 5) blood or serum glucose concentration, 

6) blood hemoglobin concentration, 

7) serum ferritin or transferring receptor concentrations, 

8) serum fibrinogen and other coagulation factor concentration, 

9) measurement of platelet activation, aggregation and/or adhesion, 
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10) serum or plasma concentrations of inflammatory markers such as CRP, 

11) other relevant information that can be obtained by chemical or biochamical 
measurements. 



5 Numerous genotyping methods have been described in the art for analysing nucleic acids 
for the presence of specific sequence variations e.g. SNP's, insertions and deletions (for 
review see Syvanen 2001 and Nedelcheva Kristensen et aL 2001). In these methods a 
sample containing nucleic acid (e.g. blood, tissue biopsy or buccal cells) is obtained from 
the patient and the sequence variations of interest are identified and visualised from the 
10 nucleic acids. 



Allelic variants in genes can be discriminated by enzymatic methods (with the aid of 
restriction endonucleases, DNA polymerases, ligases etc.), by electrophoretic methods 
(e.g. single strand conformation polymorphism (SSCP), heteroduplex analysis, fragment 
1 5 analysis and DNA sequencing), by solid-phase assays (dot blots, microarrays, 

microparticles, microtiter plates etc.) and by physical methods (e.g. hybridisation analysis, 
mass spectrometry and denaturing high performance liquid chromatography (DHPLQ). In 

used both to increase the signal to noise ratio as well as spare sample nucleic acid before 
20 allele discrimination. Detectable labels (fluorochromes, radioactive labels, biotin, modified 
nucleotides, haptens etc) can be used to enhance visualization of allelic variants. 



This invention is based on the principle that a small number of genotypings are performed, 
and the mutations to be typed are selected on the basis of their ability to predict MI and/or 
25 stroke. For this reason any method to genotype mutations in a genomic DNA sample can 
be used. If non-parallel methods such as real-time PCR are used, the typings are done in a 
row. The PCR reactions may be multiplexed or carried out separately in a row or in 
parallel aliquots. 



30 



The score that predicts the probability of MI or stroke may be calculated using a 
multivariate failure time model or a logistic regression equation as follows: 
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Probability of a cardiovascular disease = [1 + e + £ < bi#?Q »] wherein e is Napier's 
constant, Xi are variables related to the cardiovascular disease, bi are coefficients of these 
variables in the logistic function, and a is the constant term in the logistic function. The 
model may additionally include any interaction (product) or terms of any variables Xi, e.g. 
5 biXj. An algorithm is developed for combining the information to yield a simple prediction 
of MI as percentage of risk in 10 years. An alternative statistical model is a failure-time 
model such as the Cox's proportional hazards' model. 

EXPERIMENTAL SECTION 
10 Determinin g individual genotypes with SNaPShot 

* 

* 

The method according to the invention for the determination of the allelic pattern of the 
codons/mutations in question can be carried out with polymerase chain reaction (PCR) in 
combination with, for example, an allele specific primer extension method (SNaPshot, 
15 Applied Biosystems) or DNA fragment analysis followed by capillary electrophoresis with 
ABI Prism 3 100 Genetic Analyzer (Applied Biosystems). 

In a SNaPshot reaction the genomic DNA region containing the mutation is question is 
amplified with PCR- The amplified PCR reaction is purified and the product is used as a 
20 template in SNaPshot reaction. 

For the SNaPshot reaction an extension primer that ends one nucleotide 5 ' of a given 
single nucleotide polymorphism (SNP) locus is designed. In the SNaPshot reaction the 
extension primer binds to its complementary template in the presence of fluorescent 
25 labelled dideoxy-NTPs ([FJddNTPs) and DNA polymerase. The polymerase extends the 
primer by only one nucleotide, adding a single [F]ddNTP to its 3' end In the analysed data 
nucleotide A is seen in green colour, C is seen in black colour, G is seen in blue colour and 
T in red colour. If for example the genotype is A/A then only green colour is detected. For 
a heterozygous A/C green and black colour are detected. 

30 

When multiple SNPs are determined in the same reaction, the extension primers need to 
differ significantly in length (4-6 nucleotides) to avoid overlap between the final SNaPshot 
products. This can be accomplished by adding a variable number of nucleotides dT, dA, 
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dC or cGATC to the 5' end of the different extension primers. The different SNPs can then 
be detected in the capillary electrophoresis according to the different size of the SNaPshot 
product. To perform SnaPshot genotyping under standard conditions, refer to the user 
manual (ABI Prism Snapshot Multiplex kit, Protocol, Applied Biosystems). 

5 

In the DNA fragment analysis, a fluorescent lable is attached to the 5' end of the PCR 
primer. In the DNA fragment analysis, the alleles of the locus to be genotyped are different 
in length (i.e. there is a deletion or an insertion of known number of nucleotides in the 
studied locus). The different alleles can then be detected after the capillary electrophoresis 
10 due to the different migration rates of the different lengths of the per product (i.e. alleles). 

Polymerase chain reaction (PCR) 

The genomic DNA regions containing the mutations in question can be amplified with 
15 PCR either in separate reactions or all in one single reaction mix (Le. multiplex PCR) with 
PTC-220 DNA Engine Dyad PCR machine (MJ Research). The PCR amplification was 

■ 

conducted in a 20 ^1 volume: the reaction mixture contained 60 ng human genomic DNA 

-s.vi.~- XT ' j. 

nucleotides (dATP, dCTP, dGTP, dTTP), 0.5 \xM of each primers and 1 unit of the DNA 
20 polymerase (QIAGEN, Hot Start Taq DNA polymerase). The PCR conditions need to be 
determined experimentally, and the following standard protocol can be used as a start: first 
the reaction was hold 10 minutes at 94°C, then the following three steps were repeated for 
35 cycles: 45 seconds at 94°C, 45 seconds at 55°C, 1 minute 30 seconds at 72°C, after 
which the reaction was kept at 72°C for an additional 5 minutes and finally hold at 4°C. 

25 

APOB Thr98Ile (also known as APOB Thr71Ile) 

The nucleotide sequence of the primer pair for the amplification of human APOB gene 
(apolipoprotein B gene, NM_000384 (http://www.ncbLnlm.nih.gov/)) Thr98Ile mutation 
(also known as Thr71Ile mutation) was as follow: 5'- GAC AAC CTC AAT GCT CTG CT 
30 -3 ■ (SEQ ID NO: 1) and 5 TGA CTT ACC TGG ACA TGG CT -3 9 (SEQ ID NO:2). 

NPPA Val32Met 

The nucleotide sequence of the primer pair for the amplification of human NPPA 
(natriuretic peptide precursor A gene, NM_006172) (gene is also known as ANF or ANP 
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or PND or Pronatriodilatin (atrial natriuretic peptide)) Val32Met mutation was as follow: 
5'- GCC AAG AGA GGG GAA CCA GAG -3' (SEQ ID NO:4) and 5 s - AGT GAG CAC 

■ 

AGC ATC AGA AAG C-3 ' (SEQ ID NO: 5). 

5 DDAH1 IVS2-330T 

The nucleotide sequence of the primer pair for the amplification of human DDAH1 
(dimethylarginine dimethylaminohydrolase 1, NM_012137) IVS2-33C>T mutation was as 
follows: 5'- ATC CTG CTT TCT GCC CTT T -3' (SEQ ID NO:7) and 5'- AAG CCA 
GTG AAG CGT AAA CAC-3' (SEQ ID NO:8). 

10 

FGB -^55G>A 

The nucleotide sequence of the primer pair for the amplification of human FGB gene 
(fibrinogen-beta gene, NM_005141) promoter mutation -455G>A mutation was as follow: 
5'- AAC ACA CAA GTG AAC AGA CAA G-3' (SEQ ID NO: 10) and 5'- GCA CTC 
15 CTC AAA GAG AGA TG -3' (SEQ ID NO:l 1). 

NPY -52 OG 

(neuropeptide Y gene, NMJ5009.05) -52 OG mutation was as follow: 5'- GTT CTC TCT 
20 GCG GGA CTG GG-3' and (SEQ ID NO:13) 5'- CTG CCC TGG GAT AGA GCG AA- 
3' (SEQ ID NO: 14). 

CBS Ile278Thr 

The nucleotide sequence of the primer pair for the amplification of human CBS gene 
(cystathionine-beta-synthase gene, NM_000071) Ile278Thr mutation was as follow: 5'- 
GAG CCT GGG TTC TTG GGT TTC -3' (SEQ ID NO:18) and 5'- GGT TGT CTG CTC 
CGT CTG GTT -3' (SEQ ID NO: 19). 

LPL Asn3 1 8Ser (also known as LPL Asn29 1 Ser mutation) 

The nucleotide sequence of the primer pair for the amplification of human LPL gene 
(lipoprotein lipase gene, NM_000237) Asn318Ser mutation (also known as LPL 
Asn291Ser mutation) was as follow: 5'- CGC TCC ATT CAT CTC TTC ATC G -3' (SEQ 
ID NO:21) and 5'- CCC CCT ATC AAC AGA AAC ACC A -3' (SEQ ID NO:22). 



WO 2004/031407 PCT/FI2003/000740 

11 

ITGB3 Leu59Pro falso known as Leu33Pro mutation) 

The nucleotide sequence of the primer pair for the amplification of human ITGB3 
(integrin, beta 3, (platelet glycoprotein ma, antigen CD61, NM_000212) Leu59Pro 
mutation (also known as Leu33Pro mutation) was as follow: 5 5 - GCA GGA GGT AGA 
5 GAG TCG CCA -3' (SEQ ID NO:24) and 5'- GGG CAC AGT TAT CCT TCA GCA-3' 
(SEQ ID NO:25). 

NPPA OPA152Arg 

The nucleotide sequence of the primer pair for the amplification of human NPPA 
10 (natriuretic peptide precursor A gene) (gene is also known as ANF or ANP or PND or 
Pronatriodilatin (atrial natriuretic peptide)) OPA152Arg mutation was as follow: 5'- TTA 
GCA GTT CAT ATT CCT CCC C -3' (SEQ ID NO:27) and 5 5 - AGC CTC TTG CAG 
TCT GTC CC -3' (SEQ ID NO:28). 

15 Tumor necrosis factor TNF or Tumor necrosis factor alpha (TNF A) or Cachectin 
Gene map locus 6p21.3. Tumor necrosis factor is a multifunctional proinflammatory 

function. Typed mutations were from the promoter sequence [ . . . G(-376)A, G(-308)A, 
20 G(-244)A, G(-238)A ...] 

Lvmphotoxin-alpha: LTA or Lvmphotoxin-A or Tumor necrosis factor beta (TNFB) 

Gene map locus 6p21.3. Lymphotoxin-alpha is a soluble protein secreted by activated 
lymphocytes and presumed to act as a modulator in the immune response. The LT-alpha 
shares its receptor with tumor necrosis factor and binds to both TNF receptor- 1 and -2. 
25 Typed mutation: Thr26Asn 

Factor V 

Typed mutation: Arg506Gln 
Factor VII 

Typed mutaion: Del/Ins, Arg353Glu 



30 
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Purification of the PCR products for SNaPshot reaction 

The PCR products were purified with SAP (Shrimp Alkalinen Phosphatase, USB 
5 Corporation) and Exol (Exonuclease I, USB Corporation) treatment. This was done to 
avoid the participation of the unincorporated dNTPs and primers from the PCR reaction to 
the subsequent primer-extension reaction. More specifically 5 units of SAP and 2 units of 
Exol were added to 15 jxl of the PCR product. Reaction was mixed and incubated at 37°C 
for 1 hour. After that the reaction was incubated at 75°C for 15 minutes to inactivate the 
10 enzymes and afterwards kept at 4°C. 

Primer extension reaction ( SNaPshot reaction) 

In the subsequent primer extension reaction (SNaPshot reaction) 5 jil of SNaPshot 
15 Multiplex Ready Reaction Mix (Applied Biosystems), 3 pi of purified PCR products, 1 jil 
of pooled extension primers (depending of the signal in the SNaPshot reaction, the primer 
concentrations in the mix can range between 0.05 \xM and 1 \xM) and 1 jxl water are mixed 

95°C for 5 s, 50°C for 5 s and 60°C for 5 s in a PTC-220 DNA Engine Dyad PCR machine 
20 (MJ Research). 

The nucleotide sequence of the extension primer for the genotyping of human APOB 
Thr71Ile mutation in a SNaPShot reaction was as follow: 5 s - TTT TTT TTT TTT TGA 
AGA CCA GCC AGT GCA -3' (SEQ ID NO:3). 

25 

The nucleotide sequence of the extension primer for the genotyping of human NPPA 
Val32Met mutation in a SNaPShot reaction was as follow: 5'- TT TTT TTT TTT TTT 

« 

TTT AAT CCC ATG TAC AAT GCC -3' (SEQ ID NO:6). 

30 The nucleotide sequence of the extension primer for the genotyping of the human DDAH1 
IVS2-330T mutation in a SNaPShot reaction was as follow: 5'- T TTT TTT TTT TTT 
TTT TTT TTT GTA CAG TCA CTG GTG CCA -3' (SEQ ID NO:9). 
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The nucleotide sequence of the extension primer for the genotyping of human FGB 
promoter -455G>A mutation in a SNaPshot reaction was as follow: 5'- TTT TTT TTT 
TTT TTT TTT TTT TTT TTT TTC TAT TTC AAA AGG GGC-3' (SEQ ID NO:12). 

The nucleotide sequence of the extension primer for the genotyping of human NPY gene - 
52 OG mutation in a SNaPShot reaction was as follow: 5'- T TTT TTT TTT TTT TTT 
TTT TTT TTT TTT TTT GAG GAG GGA GGT GCT GCG -3' (SEQ ID NO:15). 

The nucleotide sequence of the extension primer for the genotyping of human LPL 
Asn291 Ser mutation was as follow: 5'- TTT TTT TTT TTT TTT TTT TTT TTT TTT TTT 
TTT TTT TCTTTT GGC TCT GAC TTT A -3' (SEQ IDNO:23) 

The nucleotide sequence of the extension primer for the genotyping of human ITGB3 
Leu33Pro mutation was as follow: 5'- TT TTT TTT TTT TTT TTT TTT TTT TTT TTT 
TTT TTT TTT TTT GTC ACA GCG AGG TGA GCC C -3* (SEQ ID NO:26). 

The nucleotide sequence of the extension primer for the genotyping of human NPPA 

TTT TTT TTT TTT TTT TTT CTC CCT GGC TGT TAT CTT C -3' (SEQ ID NO:29). 
Post-extension treatment 

After the primer extension reaction 1 unit of SAP was added to the reaction mix and the 
reaction was incubated at 37°C ibr 1 hour. The enzyme was inactivated by incubating the 
reaction mix at 75°C for 15 minutes. Afterwards the samples were placed at 4°C. The 
post-extension treatment was done to prevent the unincorporated fluorescent ddNTPs 
obscuring the primer extension products (SNaPshot products) during electrophoresis with 
ABI Prism 3 100 Genetic Analyzer. 

DNA fragment analysis of ADRA2B insertion/deletion polymorphism 
ADRA2B insertion/deletion mutation 

ADRA2B gene (alpha2B-adrenergic receptor gene, NM_000682) insertion/deletion 
polymorphism was as follows 5- GGG TGT TTG TGG GGC ATC TC -3' (SEQ ID 
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NO:16) and 5'- TGG CAC TGC CTG GGG TTC A -3' (SEQ ED NO:17). A fluorescent 
label has been added to the 5' end of one of the above mentioned per primers. Thus, the per 
fragment is detectable in the capillary electrophoresis conducted with ABI Prism 3100 
Genetic Analyzer. 

The insertion/deletion polymorphism of ADRA2B gene concerns an insertion or an 
deletion of three glutamic acids in the region of 12 Glu aminoacids in the codons 298-309. 
Thus depending on the genotype, there is either 9 Glu (deletion) or 12 Glu (insertion) at the 
ADRA2B locus. Depending on whether the amplified allele had an insertion or a deletion 
in the studied locus, the size of the per product was 91 bp (insertion allele) or 82 bp 
(deletion allele). Thus, for homotzygotes (insertion/insertion or deletion/deletion) only one 
size of a fragment was detected either 91 bp or 82 bp, respectively. For heterotzygotes both 
of the above mentioned fragments were detected. 

Capillary electrophoresis with ABI Prism 3 100 Genetic Analyzer 

Aliquots of 1 pi of pooled SNaPshot products, 0.5-1.0 \xl of the ADRA2B 
insertion/deletion per product, 9.00 pi of Hi-Di formamide (Applied Biosystems) and 0.25 

3100 optical microamp plate (Applied Biosystems). The reactions were denatured by 
placing them at 95°C for 5 minutes and then loaded onto a ABI Prism 3100 Genetic 
Analyzer (Applied Biosystems). Electrophoresis data was processed and the genotypes 
were visualized by using the GeneScan Analysis version 3.7 (Applied Biosystems). 

Testing the Risk of MI and stroke 

■ 

Risk factors for MI and stroke were studied in the KEHD cohort. Briefly, the "Kuopio 
Ischaemic Heart Disease Risk Factor Study" (KIHD) is a prospective population study in 
men in Eastern Finland (Salonen 1988, Tuomainen et al. 1999). The study protocol for 
KEHD was approved by the Research Ethics Committee of the University of Kuopio. The 
study sample comprised men from Eastern Finland aged 42, 48, 54 or 60 years. A total of 
2682 men were examined during 1984-89. All participants gave a written informed 
consent The follow-up of coronary and cerebrovascular events was to the end of 2000, 
providing an average follow-up time of 13.4 years. Genotypings were carried out for 
approximately 1600 men, resulting to over 21,000 person-years of follow-up. 
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Of the baseline examination participants, 1038 men were re-examined approximately four 
years after the baseline survey, in 1991-3. The mean follow-up time was 4.1 years. Of 
1 177 eligible men, 139 could not be contacted or refused to participate, 1038 (88.2%) men 
5 participated. 

A nested case-control set was selected consisting of 47 men who developed a MI by the 
end of 2000 and 47 control men matched for age, place of residence, fasting time and 
examination day, who had no MI by the end of 2000. Both the cases and the controls had 
10 no MI prior to the 1991-3 examination. Similarly, a case-control set of 22 men who had a 
stroke during the follow-up and 22 identically matched controls were selected. Neither 
group had a previous stroke prior to the 1991-3 examination. A large number of 
genotypings were carried out in these nested case-control sets. 

15 Data on CHD and cerebrovascular disease during the follow-up were obtained by 
computer linkage to the national computerized hospital discharge registry. Diagnostic 
information was collected from the hospitals and all heart attacks and cerebrovascular 

of acute coronary events was based on symptoms, electrocardiographic findings, cardiac 
20 enzyme elevations, autopsy findings and the history of CHD. Each suspected coronary 
event (ICD-9 codes 410-414 and ICD-10 codes 120-125) was classified into 1) a definite 
acute myocardial infarction (AMI), 2) a probable AMI, 3) a typical acute chest pain 
episode of more than 20 minutes indicating CHD, 4) an ischemic cardiac arrest with 
successful resuscitation, 5) no acute coronary event or 6) an unclassifiable fatal case. The 
categories 1) to 3) were combined for the present analysis to denote MI. Cerebrovascular 
events were classified according to the FINNMONICA criteria. 

The purpose of this project was to develop a simple gene test that can be used to diagnose 
CHD and cerebrovascular disease and to predict the risk of acute myocardial infarction and 
stroke in healthy and sick persons. We had several data sets available to us for this work. 
The model was constructed in a prospective nested case-control set of 50 men who did not 
have prior MI but developed an MI during a 8-year follow-up, and 50 age-matched control 
men who did not develop ME during the follow-up. This case-control set was derived from 
the KIHD 1991-3 examination, in which over 1000 men aged 46-64 from Eastern Finland 
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years were examined (see ref. 4). We typed over 100 mutations assumed to be relevant 
regarding CHD and stroke in DNA samples obtained at baseline, and collected phenotypic 
information yielding over 5000 variables. 

Of the about 100 mutations, the four most predictive ones of MI were selected using 
hierarchial step-up binary logistic modelling (Table 1). These predicted 61% of future Mis 
(R square 16%). Theoretically (based on twin studies), this is the maximal prediction that 
can be achieved by genes. The second step was to find the most predictive other variables. 
We tested similarly over 1000 variables including all known risk factors for CHD. A set of 
six variables (Table 1) was defined that increased the prediction to 80% (R square 53%), 
and the predicted probability of MI for each person varied from 0.0002 to 0.9991. These 
can be recorded using five simple questions and measuring waist and hip circumferences. 
None of the over 200 biochemical measurements tested contributed much additional 
information to the model. The same concerned blood pressure and other clinical 
measurements. Age and gender are additionally needed in the model. 

We also constructed a 3-gene model which with four questionnaire variables predicted 

Thus, we invented a 10- variable model that predicted future myocardial infarction and a 7- 
variable model that predicted stroke very well in the data set they were derived of. The 
prediction of 80% is higher than in any published epidemiologic cohort study. An 
advantage is that only a small number of genotypings need to be carried out and a very 
short self-administered questionnaire needs to be filled in. One of the mutations in both 
tests is the same, so in total only six genotypings are needed to predict both ME and stroke. 
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1. A method for detecting genetic variation or polymorphism, Le. a mutation, in 
genes encoding: 

(a) a2B-adrenoceptor, and 

(b) apolipoprotein B ; 

and in at least one of the genes encoding: 

(c) dimethylarginine dimethylaminohydrolase 1 

(d) fibrinogen-beta 

(e) natriuretic peptide precursor A, and 

(f) neuropeptide Y 
comprising the steps of: 

ii) detecting the presence of mutations in the genes, the presence of a mutation in 
three or more of the genes indicating an increased risk of coronary heart disease 
(CHD) and/or myocardial infarction (MI) in said subject. 

2. The method according to claim 1, wherein a variation or polymorphism is further 
detected in at least one of the genes selected from the group consisting of: 

(g) cystathione beta synthase 
00 glycoprotein nb/EQa 

(i) lipoprotein lipase 

(j) tumor necrosis factor alpha (TNFA) 
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(k) coagulation factor 5 (F5) 

(1) coagulation factor 7 (FT), and 
(m) Lymphotoxin-alpha (LTA) 

5 

3. The method according to claim 1 or 2, wherein the detection step is a nucleic 
acid assay. 

4. The method according to claim 3, wherein the detection step is carried out 
using a gene or DNA chip, microarray, strip, panel or similar combination of 

10 more than one genes, mutations or RNA expressions to be assayed. 

5. The method according to claim 3, wherein the polymorphisms are determined 
using polymerase chain reaction. 

6. The method according to claim 1, wherein the biological sample is a blood 
sample or buccal swab sample, 

lnxurcoaaon concerning ugv 9 ^oaiuw, u±&±wAmy maiuiy <j± t*K*va^ u met* 
diseases and hypercholesterolemia, and the medical history concerning 
cardiovascular diseases of the subject with the results obtained from step ii) of 
the method for confirming the indication obtained from the detection step. 

20 8, The method according to claim 7, wherein said information is about 

hypercholesterolemia in the family, smoking status, CHD in the family, history of 
cardiovascular disease, obesity in the family, and waist-to-hip circumference ratio 
(cm/cm) or said information is about antihypertensive medication, smoking status, 
frequency of hangovers and body mass index. 

25 

9. The method according to claim 1, further comprising a step determining blood, 
serum or plasma cholesterol, HDL< cholesterol, LDL cholesterol, triglyceride, 
apolipoprotein B and AI, fibrinogen, ferritin, transferring receptor, C-reactive protein, 
serum concentration or plasma insulin concentration. 

30 
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10. Hie method according to claim 1, wherein the selected genes are natriuretic 
peptide precursor A, ct2B-adrenoceptor, apolipoprotein B and dimethylarginine 
dimethylaminohydrolase 1. 

11. The method according to claim 1, wherein the selected genes are fibrinogen-beta, 
(X2B-adrenoceptor, apolipoprotein B and neuropeptide Y. 

12. The method according to claim 1 further comprising a step of determining height, 
weight, systolic and diastolic blood pressure, heart rate, maximal oxygen uptake, or 
other electrocardiographic measurement of the subject. 

4 

13. The method according to claim 10, wherein the detected mutations are Val32Met 

of natriuretic peptide precursor A, an insertion/deletion of three glutamic acids in the 

region of 12 Glu aminoacids in the codons 298-309 of a2B-adrenoceptor, Thr9811e of 

apolipoprotein B and SNP IVS2-330T of dimethylarginine dimethylaminohydrolase 
1. 



455G>A of fibrinogen-beta, an insertion or deletion of three glutamic acids in the 
region of 12 Glu aminoacids in the codons 298-309 of a2B-adrenoceptor, and SNP - 
52C>G of neuropeptide Y. 

15. The method according to any one of the preceding claims further comprising a 
step of calculating the probability of a cardiovascular disease using a logistic 
regression equation as follows: 

Probability of a cardiovascular disease = [1 + e C-^ + Wx*))] - 1 % where e is Napier's 
constant, Xj are variables related to the cardiovascular disease, hi are coefficients of 
these variables in the logistic function, and a is the constant term in the logistic 
function. 



16. The method according to claim 15, wherein a and bj are determined in the 
population in which the method is to be used. 
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17. The method according to claim 15, wherein Xi are selected among the variables 
that have been measured in the population in which the method is to be used 

5 18. The method according to claim 15, wherein bi are between the values of -20 and 
20. 

19. The method according to claim 15, wherein Xj are binary variables that can have 
values or are coded as 0 (zero) or 1 (one). 

10 

20. The method according to claim 15, wherein i are between the values 0 (none) and 
100,000. 

21. A kit for diagnosing a susceptibility to a cardiovascular disease especially 
15 myocardial infarction (MI) and stroke in a subject, comprising means for 

detecting genetic variation or polymorphism, i.e. a mutation, in genes: 



(b) apolipoprotein B ; 
and in at least one of the genes: 
20 (c) dimethylarginine dimethylaminohydrolase 1 

(d) fibrinogen-beta 

(e) natriuretic peptide precursor A, and 

(f) neuropeptide Y 

and optionally software to interpret the results of the detection. 

25 22. The kit according to claim 21, comprising a DNA chip, microarray, DNA strip, 
DNA panel or real-time PGR based tests. 



23. The kit according to claim 21, comprising a questionnaire for obtaining patient 
information concerning age, gender, height, weight, the family history of 



im 
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cardiovascular diseases and hypercholesterolemia, the medical history concerning 
cardiovascular diseases. 
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SEQUENCE LISTING 

<110> Salonen, Jukka T 

Tuoma inen , Tomi - Pekka 
Pirskanen, Mia 

<120> Method for detecting the risk of cardiovascular diseases 
<160> 29 

<170> Patentln version 3.1 

<210> 1 
<211> 20 
<212> DNA 

<220> 

<221> ciisc_f eature 

<222> (1) . . (20) 

<223> APOB per primer F 

<400> 1 

gacaacctca atgetctget 2 0 

<210> 2 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 



1/13 
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<221> 



raise feature 



(1$ . ; 



(20) 



APOB per primer R 



<400> 2 

tgacttacct ggacatggct 



20 



<210> 3 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<221> mis cofeature 

<222> (1) . . (30) 

<223> APOB SNaPshot primer forward 

<40#> 3 

tttttttttt tttgaagacc agecagtgea 3 0 

<210> 4 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<221> mis cofeature 

<222> (1) . . (21) 

<223> NPPA per primer f 



<400> 4 

gecaagagag gggaaccaga g 



21 
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<210> 5 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<221> misc_feature 

<222> (1)..(22) 

<223> NPPA per pricier r 



<400> 5 

agtgagcaca gcatcagaaa gc 22 

<210> 6 
<211> 35 
<212> DNA 

a.r-t-.if icial 



<220> 

<221> misc_f eature 

<222> (1) . . (35) 

<223> NPPA SNaPshot primer reverse 



<400> 6 

tttttttttt tttttttaat cccatgtaca atgee 3 5 

<21C> 7 

<211> 19 

<212> DNA 

<213> Artificial 



<220> 
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<221> 



misc feature 



<222> 



(1) . . (19) 



<223> 



DDAH1 IVS2-33C>T pre primer F 



<400> 7 

atectgettt ctgcccttt 



19 



<210> 6 

<212> 21 

<212> DMA 

<213> Artificial 

<220> 

<221> misc_f eature 

<222> (1) . . (21) 

<223> DDAH1 XVS2-33C>T pre primer r 

<4TO> U 

aagccagtga agegtaaaca c 21 

<210> 9 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<221> misc_f eature 

<222> (1) . . (40) 

<223> DDAH1 IVS2-33C>T SNaPshot primer forward 



<400> 9 

tttttttttt tttttttttt ttgtacagtc actggtgcca 



40 



<210> 
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<211> 22 

<212> BNA 

<213> Artificial 



<220> 

<221> mis cofeature 

<222> (1) . . (22) 

<223> FGB -455G>A per primer F 

<400> 10 

aacacacaag tgaacagaca ag 

<210> 11 

<211> 20 

<212> DNA 

<213> Artificial 



<221> misc_f eature 
<222> (1)..(20) 

<223> FGB -455G>A per primer r 



<400> 11 

gcactcctca aagagagatg 

<210> 12 

<211> 45 

<212> DNA 

<213> Artificial 



<220> 

<221> misc feature 
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<222> (1)..(45) 

<223> FGB -455G>A SNaPshot oligo reverse 



<400> 12 

tttttttttt tttttttttt tttttttttc tatttcaaaa ggggc 45 

<210> 13 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<22±> mis cofeature 

<:222> (1)..(20) 

<223> NPY -52C>G per primer f 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<221> raisc_f eature 

<222> (1) . . (20) 

<223> NPY -52C>G per primer r 



<400> 14 

ctgccctggg atagagegaa 20 
<210> 15 
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<211> 5C 

<212> DNA 

<213> Artificial 



<220> 

<221> misc_feature 

<222> (1) . . (50) 

<223> NPY -52C>G SNaPshot primer forward 



<400> 15 

tttttttttt tttttttttt tttttttttt ttgaggaggg aggtgctgcg 50 

a 

<210> 16 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<221> mis cofeature 

<222> (1) . . (20) 

<223> ADRA2B per primer f 



<400> 16 

gggtgtttgt ggggcatctc 20 

<210> 17 

<211> 19 

<212> DNA 

<213> Artificial 



<220> 

<221> misc feature 
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<222> (1)..(19) 

<223> ADRA2B per primer r 

■ 

<400> 17 

tggcactgcc tggggttca 19 

<2I0> 18 

<211> 21 

<212> DKA 

<213> Artificial 



<220> 

<221> misc_f eature 

<222> (1}..(21) 

<223> Description of Artificial sequence: FCR primer 
<220> 

<^^±> fflise_£ eature 

<222> (1) . . (21) 
<223> 



<400> 18 

gagcctgggt tcttgggttt c 

<210> 19 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<221> mi sc_f eature 

<222> (1>..(21) 

<223> CBS pre primer r 
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<220> 

<221> misc_f eature 

<222> (1) . . (21) 
<223> 



<400> 19 

ggttgtctgc tccgtctggt t 21 

<210> 20 

<211> 25 

<212> DKA 

<213> Artificial 



<220> 

<221> misc feature 



<223> snapshot primer cbs forward 



<40C> 20 

ttttttccgc gccctctgca gatca 25 

<210> 21 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<221> uiisc_f eature 

<222> (1) . . (22) 

<223> IiPIi per primer P 
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<400> 21 

cgctccattc atctcttcat eg 

<210> 22 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<221> misc_feature 

<222> (1)..(22) 

<223> LP!* per primer R 



<400> 22 

ccccctatca acagaaacac ca 

<210> 23 

<211> 55 

<z±^> Dim 

<213> Artificial 



<220> 

<221> misc_feature 

<222> (1)..(55) 
<223> 



<220> 

<221> rcisc_feature 

<222> (1) . . (19) 

<223> LPL SNaPShot primer 



22 



22 



<400> 23 

tttttttttt tttttttttt tttttttttt tttttttctt ttggctctga cttta 



55 
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<210> 24 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<221> misc__f eature 

<222> (1) . . (21) 

<223> ITGB3 per prirter F 

<400> 24 

gcaggaggta gagagtcgee a 

<210> 25 
<211> 21 
<212> DNA. 

<220> 

<221> misc_f eature 

<222> (1) . . (21) 

<223> ITGB3 per primer R 

<400> 25 

gggcacagtt atccttcagc a 

<210> 26 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 
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740 



<221> 



misc feature 



<222> 



(1) . - (60) 



<223> 



ITGB3 SNaPshot primer reverse 



<400> 26 

tttttttttt tttttttttt tttttttttt tttttttttt tgtcacagcg aggtgagccc 



60 



<210> 27 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<221> raisc_f eature 

<222> (1) . . (22) 

<223> NPPA per primer F 

<400> 27 

ttagcagttc atattcctcc cc 22 

<210> 28 

<211> 20 

<212> DMA 

<213> Artificial 

<220> 

<221> ciisc_f eature 

<222> (1) . . (20) 

<223> NPPA per primer R 



<400> 28 

agcctcttgc agtctgtccc 



20 
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<210> 29 

<211> 65 ' 

<212> DNA 

<213> Artificial 



<220> 

<221> raisc_feature 

<222> (1) . . (65) 

<223> NPPA SNaPshot primer reverse 



<400> 29 

tttttttttt tttttttttt tttttttttt tttttttttt ttttttctcc ctggctgtta 
tcttc 
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